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Abstract. We compare classes of finite relational structures via monadic second-order 
transductions. More precisely, we study the preorder where we set C C K if , and only if, 
there exists a transduction r such that C C t(K). If we only consider classes of incidence 
structures we can completely describe the resulting hierarchy. It is linear of order type 
uj+3. Each level can be characterised in terms of a suitable variant of tree- width. Canonical 
representatives of the various levels are: the class of all trees of height n, for each n 6 N, 
of all paths, of all trees, and of all grids. 



1. Introduction 

Monadic second-order logic (MSO) is one of the most expressive logics for which the 
theories of many interesting classes of structures are still decidable. In particular, the infi- 
nite binary tree and many linear orders have a decidable MSO-theory [Rab69[ IShe75| and 
the same holds for many classes of (finite or infinite) structures with bounded tree-width 
|BCL07[ IRS86| . Furthermore, for every fixed MSO-sentence ip and every class C of finite 
structures with bounded tree-width, there is a linear-time algorithm that checks whether 
a given structure from C satisfies ip |Bod96l IFG06) . Examples of monadic second-order ex- 
pressible graph properties are /c-colourability, various types of connectivity, and planarity 
(via Kuratowski's well-known characterisation by forbidden configurations). 

A variant of monadic second-order logic called guarded second-order logic (GSO) allows 
quantification not only over sets of elements but also over sets of edges (i.e., tuples from 
the relations) |GHO02] . The above mentioned linear-time algorithms can be adapted to this 
logic. There are tight links between guarded second-order logic and tree- width: every class of 
(finite or infinite) relational structures with a decidable GSO-theory has bounded tree- width. 
This gives a sort of converse to the above mentioned decidability results |See911 Cou95j . The 
proof of this result uses a deep theorem of graph minor theory by Robertson and Seymour : 
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a set of graphs has bounded tree-width if and only if it excludes some planar graph as a 
minor |RS86| . 

To compare the MSO-theories or GSO-theories of two classes of structures we can use 
monadic second-order transductions, a certain kind of interpretations suitable both, for 
monadic second-order logic and, using a detour via incidence structures, also for guarded 
second-order logic |BCL07l ICoulJTl ICou951 ICou03] . 

In the present article we classify classes of finite structures according to their 'combi- 
natorial complexity'. (Note that we do not consider decidability issues.) We will consider 
two ways to measure the complexity of such classes. On the one hand, we can use their 
tree-width and its variants. On the other hand, we can compare them via transductions. As 
it turns out, these two approaches are equivalent and they give rise to the same hierarchy. 
This indicates the robustness of our definitions and their intrinsic interest. Other possible 
hierarchies, based on different logics, will be briefly mentioned in Section [9l 

Let us give more details. An MSO-transduction is a transformation of relational struc- 
tures specified by monadic second-order formulae. As graphs can be represented by rela- 
tional structures, we can use MSO-transductions as transformations between graphs. An 
MSO-transduction is a generalisation of the following kind of operations (see Definition 13. 4p : 

(i) the definition of a relational structure "inside" another one (in model theory this is 
called an interpretation) ; 

(ii) the replacement of a structure 21 by the union of a fixed number of disjoint copies 
of 21, augmented with appropriate relations between the copies; 

(iii) the expansion of a given structure 21 by a fixed number of unary predicates, called 
parameters. Usually, these predicates are arbitrary subsets of the domain, but we also may 
have a formula imposing restrictions on them. 

Because of the possibility to use parameters, a transduction r is a many-valued map in 
general. (We may also think of it as non-deterministic.) Each relational structure 21 has 
several images r(2l, Pi, ... , P n ) depending on the choice of the parameters Pi, . . . , P n C A. 
If OS = t(21, Pi, ... , P n ) we can consider the tuple Pi, . . . , P n as an encoding of *B in 21. The 
transduction r is the corresponding decoding function. 

Each transduction r extends in a canonical way to a transformation between classes of 
structures. If C and K, are classes of relational structures with C C r(/C), we can think of r 
as a way of encoding the structures in C by elements of /C. For instance, every finite graph 
can be encoded in a sufficiently large finite square grid (by a fixed transduction r). Every 
finite tree of height at most n (for fixed n) can be encoded in a sufficiently long finite path. 
But it is not the case that all finite trees can be encoded by paths (by a single transduction). 

The purpose of this article is to classify classes of finite relational structures according 
to their encoding powers. We will compare classes C and /C of structures by the following 
preorder : 

C Q IC : iff C C r(/C) for some MSO-transduction r . 

We attack the problem of determining the structure of this preorder. Since, at the moment, 
a complete description of this hierarchy seems to be out of reach, we concentrate on a variant 
where we replace monadic second-order logic by guarded second-order logic. In this case, the 
corresponding hierarchy can be described completely. To obtain a corresponding notion of 
transduction we cannot simply change the definition of an MSO-transduction to use GSO- 
formulae since the resulting notion of transduction would not yield a reduction between 
GSO-theories, and it even would not be closed under composition. Instead, we will take 
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a detour by combining ordinary MSO-transductions with a well-known translation between 
GSO and MSO. 

This translation is based on incidence structures. Let us first describe this notion for 
undirected graphs where it is very natural. There are two canonical ways to encode a 
graph by a relational structure. We can use its adjacency representation which is a 
structure (V, edg) where the domain V consists of all vertices of <3 and edg is a binary relation 
containing all pairs of adjacent vertices. But we also can use the incidence representation 
of (5. This is the structure {V U E, in) where the domain V U E contains both, the vertices 
and the edges of (3, and in is the incidence relation between vertices and edges. In a similar 
way, we can associate with every relational structure 21 its incidence structure 2lj n (see 
Definition 12 . 1 [) where the domain also contains elements for all tuples in some relation of 21. 

It is shown in |GHO02| that every GSO-formula tp talking about some structure 21 can 
be translated into an MSO-formula talking about the incidence structure 2lj n , and vice versa. 
Hence, we can use incidence structures to obtain an analog ue l^fn of the preorder C suitable 
for guarded second-order logic. We set 

C Ein : iff C; n C r(/Ci n ) for some MSO-transduction r , 

where C m := { 2l; n | 21 € C }. The main result of the present article is a complete characterisa- 
tion of the resulting hierarchy for classes of finite structures. We show that the preorder Cj n 
is linear of order type oj + 3. It turns out that every class of finite structures is equivalent 
to one of the following classes, listed in increasing order of generality: 

• trees of height at most n, for each n £ N; 

• paths; 

• arbitrary trees (equivalently, binary trees) ; 

• (square) grids. 

Each of these levels can be characterised in terms of tree decompositions. Hence, we also 
obtain a corresponding hierarchy of complexity measures on structures that are compatible 
with MSO-transductions transforming incidence structures. 

The upper levels of the hierarchy can be determined easily using techniques from graph 
minor theory developed by Robertson and Seymour, such as the notions of a minor and a 
tree decomposition. In particular, we employ two results characterising bounded tree-width 
and bounded path- width in terms of excluded minors [RS83, RS86j. 

For the lower levels, which consist of classes of bounded path-width, the characterisation 
is more complicated and requires new results relating tree decompositions and monadic 
second-order logic. 

In Sections [2] and [3] we give basic definitions. Section |4] collects some known results 
from graph minor theory. We also introduce a new variant of tree-width and prove some 
of its basic properties. In Section [5] we expound the connections between tree- width and 
monadic second-order transductions. In Section [6] we introduce the transduction hierarchy 
and we state our main theorem. Its proof is contained in Sections [7] and [8j In the first one, 
we prove that the hierarchy is strict while, in the second one, we show that it covers every 
class. The final Section [9] contains some extension of our results to other logics and some 
open problems in this direction. 
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2. Preliminaries 

Let us fix our notation. We set [n] := {0, . . . , n — 1} and we write &{X) for the power 
set of a set X. We denote tuples a with a bar. The components of a will be do, • • • , a n -l 
where the length n will usually be implicit. We sometimes identify a tuple a with the set of 
its components. For instance, we write c £ a to express that c = dj, for some i. 

In this article all graphs, trees, and relational structures are finite. We will not repeat 
this finiteness assumption. A relational structure 21 is of the form (A, Rq, . . . , i?^ t _ 1 ) with 
domain A and relations Rf. The signature of such a structure is the set E = {Rq, . . . , R m -i} 
of relation symbols. In some proofs we will also use signatures with constant symbols 
denoting elements of the domain. We write ar(i?) for the arity of a relation R. For a 
signature U, we denote by STTRfZ 1 ] the class of all I7-structures. We write 21 © 23 for the 
disjoint union of the structures 21 and 03. 

We mainly consider incidence structures. These are representations of structures 21 
where we have added new elements to the domain, one for each tuple in the relations of 21. 

Definition 2.1. Let 21 = (A, Pq , . . . , be a structure and let r be the maximal arity 

of a relation Ri. The incidence structure of 21 is the structure 

2t in := (A U E, Pr,,, . . . , Pij m _! , in , . . . , in r _i) , 

where we extend the domain A by 

E := Rq U • • • U i? m _i , 

and the relations are 

P Ri :={c€E\c€Rf}, 

inj := {(a,c) G A x E \ \c\ > i and a = Ci } . 

The class of all incidence structures is STM^i;] := {2l in | 21 G STM[i7] }. 

Remark 2.2. Note that incidence structures are binary (i.e., their relations have arity at 
most 2). Hence, they can be regarded as bipartite labelled directed graphs. 

One important property of incidence structures is the fact that they are sparse, i.e., 
their relations contain few tuples. 

Definition 2.3. Let k € N. A structure 21 = (A,R) is A;-spars^| if, for every subset X C A 
and all relations Ri, we have 

\Ri n A ar ^^| <k-\X\. 

Lemma 2.4. Every incidence structure is 1-sparse. 

Let us fix our notation regarding trees and graphs. 

Definition 2.5. A directed graph is a pair (V, edg) where V is the set of vertices and 
edg C V x V is the edge relation. Thus, graphs are by definition simple (without parallel 
edges). An undirected graph is a graph where the edge relation edg is symmetric. When 
speaking of a graph we will always mean an undirected one. 

We regard a coloured graph as a relational structure {V, Eq, . . . , Ek,Po, ■ ■ ■ , P m ) with 
binary relations Ei and unary relations Pj that encode the colours of, respectively, the edges 
and the vertices. We allow graphs whose edges and vertices have several colours. 



^ ICouOS] introduced two notions of sparsity for hypergraphs: k-sparse hypergraphs and uniformly k-sparse 
hypergraphs. What we call fc-sparse is a slight modification of the uniform version of |Cou03| . 
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Trees play a major role in this article. Intuitively, a tree is a directed graph T with a 
unique vertex r of indegree 0, called the root of T, such that every vertex is reachable from r 
by a unique directed path. The actual definition we will use is slightly more concrete. It is 
based on the usual encoding of the vertices of a tree by finite sequences describing the path 
from the root to the given vertex. In fact, we introduce two notions of a tree: order-trees 
and successor-trees. The latter use the usual edge relation, while the former are equipped 
with the tree-order instead. 

Definition 2.6. Let D be a set. 

(a) For an ordinal a, we denote by D <a the set of all sequences of elements of D of 
length less than a. The prefix relation on D <u; is defined by 

x <y : iff y = xz , for some z G D <UJ . 

The infimum of x and y with respect to i.e., their longest common prefix, is denoted by 
x n y. 

(b) A finite prefix closed subset T C D <ul is called a tree domain. Following our intuition 
that a vertex is represented by the path leading to it, we call the empty sequence () the root 
of T and the maximal elements of T its leaves. The domain of the complete m-ary tree of 
height n is m <n . Hence, m <0 is the empty tree, m <l the one consisting only of the root, 
and m <2 consists of a root and m leaves. 

Given a tree domain T we can define the successor relation edg on T by setting 

(x, y) G edg : iff y = xd for some d G D . 

In this case we call y a successor of x and x the predecessor of y. A structure of the form 
(T, edg) (and every structure isomorphic to it) is called a successor-tree. 

Sometimes it is convenient to replace the successor relation edg by the tree order ■<. 
Structures of the form (T, ■<) are called order-trees. A coloured tree is the expansion of a 
(order- or successor-) tree by unary predicates P. (We do not require these predicates to 
be pairwise disjoint. Hence, every vertex may have none, one, or several colours.) We write 
TREE m for the class of all order-trees (T, ^, Po, . . . , P m ~i) with m colours. The set of leaves 
of a tree 1 is denoted by Lf (T) . 

(c) Let T = (T, ■<) be an order-tree. The level of an element v G T is the number of 
vertices u G T with u ~< v . We denote it by \v\. The height of 1 is the least ordinal a greater 
than the level of every element of T. Hence, the empty tree has height and the tree with 
a single vertex has height 1. The out-degree of T is the maximal number of successors of a 
vertex of X. For successor-trees we define these notions analogously. 

(d) Let 1 be a tree and v a vertex of X. The subtree of X rooted at v is the subtree X„ 
consisting of all vertices u with v ■< u, i.e., all vertices below v. 

Sometimes it is possible to reduce statements about relational structures to statements 
about graphs. One way to do so consists in replacing a structure by its Gaifman graph. 

Definition 2.7. The Gaifman graph of a structure 21 = (A, R) is the undirected graph 

Gf(2t) := (A edg), 
with the same domain A and with the edge relation 

edg := { (u, v) \ u ^ v and there is some c G Rf with u,v G c } . 
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3. Monadic second-order logic and transductions 

Monadic second-order logic (MSO) is the extension of first-order logic by set variables 
and quantifiers over such variables. An important variant of MSO is guarded second-order 
logic (GSO) where one can quantify not only over sets of elements but also over sets of tuples 
from the relations (see [GHO02] for details). Hence, guarded second-order logic over a given 
structure 21 is equivalent to monadic second-order logic over its incidence structure 2l; n . 

Lemma 3.1 ( [GHO02] ). 

(a) For every GSO-sentence <p, we can effectively construct an MSO-sentence ip such that 

21 |= <p iff 2lj n |= ip , for all structures 21 . 

(b) For every MSO-sentence ip, we can effectively construct a GSO-sentence ip such that 

2lin \= <p iff 21 |= ip , for all structures 21 . 

Throughout the article we will consistently work with incidence structures, thereby 
avoiding the treatment of guarded second-order logic. In particular, all formulae are tacitly 
assumed to be MSO-formulae. 

Besides MSO and GSO we also consider their counting extensions CMSO and CGSO. 
These add predicates of the form \X\ = k (mod m) to, respectively, MSO and GSO, where 
X is a set variable and k, m are numbers. All of our results for GSO go through also for 
CGSO, i.e., for CMSO-transductions between incidence structures. In Section[9]we will give a 
partial characterisation of the hierarchy for CMSO-transductions of graphs (not of incidence 
graphs). In this case the availability of counting predicates does make a difference. 

To state the composition theorem below it is of advantage to work with a variant of 
MSO without first-order variables. This variant has atomic formulae of the form X C.Y and 
RZ, for set variables X, Y, Zq, Z\, . . . , where a formula of the form RZ states that there are 
elements ctj E Zi such that the tuple a is in R. Note that every general monadic second-order 
formula with first-order variables can be brought into this restricted form by replacing all 
first-order variables by set variables and adding the condition that these sets are singletons. 

Whenever we speak of MSO we will have this version in mind. In particular, the follow- 
ing definition of the rank of a formula is based on this variant. When writing down concrete 
formulae, on the other hand, we will allow the use of first-order variables to improve read- 
ability. We regard every such formula as an abbreviation of a formula of the restricted 
form. Similarly, when we use structures with constants we actually regard each constant as 
a singleton set. 

Definition 3.2. (a) The rank qr(</?) of a formula ip is the nesting depth of quantifiers in p. 
Formulae of rank are called quantifier-free. 

(b) The monadic theory of rank m of a structure 21 is 

MTh m (2l) := { ip G MSO \2L\=<p, qr(p) < m } . 

For a tuple a of elements of 21, we also consider the monadic theory MTh m (2t, a) of the 
expansion (21, a). 

Remark 3.3. We use the term 'rank' instead of the more natural 'quantifier rank' since in 
Section [9] below we will consider CMSO where the notion of rank has to be adapted for our 
results to go through. 
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In order to compare the monadic theories of two classes of structures we employ MSO- 
transductions. To simplify the definition we introduce three simple operations and we obtain 
MSO-transductions as compositions of these. 

Definition 3.4. (a) Let k > 2 be a natural number. The operation copy fc maps a structure 21 
to the expansion 

copy fe (2l) := (21 © • • • 21, ~, P , ■ ■ ■ , Pk-l) 

of the disjoint union of k copies of 21 by the following relations. Denoting the copy of an 
element a G A in the i-th component of 21 © ••• 21 by the pair (a, i), we define 

Pi := { (a, i) | a G A } and (a, i) ~ (b, j) : iff a = b . 

For k = 1, we set copy 1 (2l) := 21. 

(b) For m G N, we define the operation exp m that maps a structure 21 to all possible 
expansions by m unary predicates Qq, . . . ,Q m -i C A. Note that this operation is many- 
valued and that exp is just the identity. 

(c) A basic MSO -transduction is a partial operation r on relational structures described 
by a list 

(x,S{x),ipo(x),...,ip s - 1 (x)) 

of MSO-formulae called the definition scheme of r. Given a structure 21 that satisfies the 
formula \ the operation r produces the structure 

r(2l) := (D,/Jo,...,fl s _i) 

where 

D := {a G A | 21 |= <5(a) } and Ri := { a € D ar ^ | 21 (= ^(o) } . 

If 21 ^ x then r(2l) remains undefined. 

(d) A k-copying MSO -transduction r is a (many-valued) operation on relational struc- 
tures of the form tq o copy fc o exp m where tq is a basic MSO-transduction. When the value 
of k does not matter, we will simply speak of a transduction. 

Note that, due to exp m , a structure can be mapped to several structures by a transduc- 
tion. Consequently, we consider r(2l) as the set of possible values (tq o copy fc )(2t, P) where 
P ranges over all m-tuples of subsets of A. 

For a class C, we set 

r(C) :=(J{t(3Q I 21 G C } . 

Remark 3.5. (a) The expansion by m unary predicates corresponds, in the terminology of 
[Cou95|, ICou03| . to using m parameters. We will use this terminology, for instance, in the 
proof of Theorem 15.51 

(b) Note that every basic MSO-transduction is a 1-copying MSO-transduction without 
parameters. 

Example 3.6. (a) Let U be a signature and let r be the maximal arity of a relation in S. 
The operation mapping an incidence structure 2ti n G §TMi n [I7] to the structure Gf(2t)i n is a 
/c-copying MSO-transduction where k = r(r — l)/2, for r > 2, and k = 1, for r < 1. 

(b) For every fixed number n G N, we describe a transduction r transforming a path *P 
of length Z into the class of all trees of height n with I + 1 vertices. 
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We can encode a tree T of height n with m vertices as a finite word w of length m over 
the alphabet [n] as follows. Let vq <\ cx • • • <i ex i> m -i be the enumeration of the vertices of T 
in lexicographic order, and let li be the level of V{. We encode T by the word w := Iq . . . l m —l- 
A transduction can recover T from w as follows. Each position in w corresponds to a vertex. 
The predecessor of the i-th vertex v is the maximal vertex to the left of v whose label is less 
than li. Clearly, this predecessor relation is definable in monadic second-order logic. 

The two most important properties of MSO-transductions are summarised in the follow- 
ing lemmas. 

Lemma 3.7. Let t be a transduction. For every MSO-sentence <p, there exists an MSO- 
sentence ip T such that, for all structures 21, 

21 (= tp T iff 53 ^ </> for some 53 € r(2l) . 

Furthermore, if r is quantifier- free, then the rank of ip T is no larger than that of (p. 

Corollary 3.8. For every quantifier-free transduction r and every m 6 N, there exists a 
function f T on monadic theories of rank m such that 

MTh m (r(2l)) = / T (MTh m (2l)) , for all structures 21 . 

Lemma 3.9 ( |Cou91| ). For all transductions a,T there exists a transduction g such that 
g = a o t. 

As a further example note that we can use transductions to translate between order-trees 
and successor-trees. 

Lemma 3.10. (a) There exists a transduction r mapping an order-tree to the corresponding 
successor-tree. 

(b) There exists a transduction a mapping a successor-tree to the corresponding order- 
tree. 

Similarly there are transductions translating between a structure and its incidence struc- 
ture. 

Lemma 3.11. For every signature U, there exists a transduction r such that r(2li n ) = 21, 
for all 21 G STM^]. 

The converse statement is a much deeper result and requires the structure in question 
to be fe-sparse for some fixed k. 

Theorem 3.12 ([Cou03, BlulO]). For every signature U and all numbers k € N, there exists 
an MSO -transduction r such that r(2t) = 2ti n , for all k-sparse structures 21 E §>TIR[I7]. 

We have seen in Lemma 13.71 that transductions relate the monadic theories of two 
structures. We also need techniques to relate the monadic theory of a structure to those of 
its substructures. The disjoint union operation can frequently be used for this purpose (for 
a proof of the following theorem see, e.g., Theorem 7.11 of [Lib04j, or [Cou87j). 

Theorem 3.13. Let U and r be relational signatures with constants. For every m E N, 
there exists a (computable) binary operation © m on monadic theories of rank m such that 

MTh m (2l 53) = MTh m (2l) m MTh m (5S) , 

for all E -structures 21 and T '-structures 53. 
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Below we will mainly make use of the following corollary. 

Lemma 3.14. Let T be an order-tree and v € T a vertex. Suppose that T' is the order-tree 
obtained from T by replacing the subtree T„ by some tree 6. Let c be a tuple of vertices 
of X with v ^ Ci, for all i. If a are vertices of % v and b are vertices of & such that 

MTh m (1 v ,a) = MTh m (6,6) 

then it follows that 

MTh m (T, ac) = MTh m (T',6c) . 

Proof. Let £ be the tree obtained from X by replacing the subtree % v by a single vertex w. 
We define the following auxiliary predicates: 

P := {w} , Qo := { x G C \ x -< w } , Q\ := T v , and Q[ := S . 

We construct a quantifier-free transduction r such that 

T((€,P,Q ,c)e(1 v ,Qi,a)) = {%ac), 

r((<t,P,Q ,c) (B (&,Q[,b)) = (<!', be) . 

If f T is the function from Corollary 13.81 and © m the operation from Theorem 13.131 it follows 
that 

MTh m (X',6c) = / T (MTh m (C,P,Qo,c) m MTh m (6, Q[, 6)) 

= / T (MTh OT (£, P, Q , c) 9 m MTh m (X 1 „ Qi,a)) = MTh m (X, ac) , 

as desired. 

Hence, it remains to define r. Let {^(, P,Qo,d} be the signature of (C,P,Qq,c), and 
{^jQije} the signature of (X„,Qi,a) and (&,Qi,b). For r we can use the basic MSO- 
transduction consisting of the following formulae : 

X ■= true , 
S(x) := ->Px , 

<p^(x,y) := x ^ y V (Q x A Qiy) . □ 

4. Minors and tree decompositions 

Some properties of the transduction hierarchy, which we will introduce in Section [6] 
below, can be deduced from results about graph minors. 

Definition 4.1. (a) Let & = (V, edg) be an undirected graph and E C edg a set of edges. 
We denote by E* the reflexive and transitive closure of E. Note that E* is an equivalence 
relation. The graph <&/E is obtained by contracting all edges in E. Formally, we have 

&/E:= (W,edg ), 

where W := V/E* is the set of equivalence classes and edg contains an edge between classes 
[x] and [y] if and only if [x] ^ [y] and there are representatives u € [x] and v E [y] with 
{u, v) G edg. 

(b) A minor of a graph <3 is a graph that can be obtained from <3 by first deleting 
some vertices and edges and then contracting some of the remaining edges. For a class C of 
graphs, we denote by Min(C) the class of all minors of graphs in C. 
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One central tool in graph minor theory is the notion of a tree decomposition and the 
related complexity measures called tree-width and path-width. These notions extend in a 
natural way to relational structures. 

Definition 4.2. Let 21 = (A,R) be a structure. 

(a) A tree decomposition of 21 is a family D = (U V ) V £T of (possibly empty) subsets 
U v C A indexed by a rooted tree T such that 

• for every element a € A, the set {v £ T \ a £ U v } is nonempty and connected in T; 

• for every tuple c G Ri, there is some index v G T with c^U v . 

We call the sets U v the components of the decomposition, and T is its underlying tree. 

The height of a tree decomposition D = {U v ) v& t is the height of T, while its width is 
the number 

wd(D) := sup([J7„| - 1) . 

(b) The tree-width twd(2t) of 21 is the minimal width of a tree decomposition of 21. 

(c) The path-width pwd(2l) of 21 is the minimal width of a tree decomposition of 21 where 
the underlying tree is a path. 

(d) The n-depth tree-width twd n (2l) of 21 is the minimal width of a tree decomposition 
of 21 whose underlying tree has height at most n. 

(e) For a class C of structures, we define twd(C) as the supremum of twd(2l), for 21 G C, 
and similarly for pwd(C) and twd n (C). 

Remark 4.3. (a) The n-depth tree-width of a graph <3 is related to its tree-depth td(0) 
as introduced by Nesetfil and Ossona de Mendez [NdM06a, NdMOGbJ. The tree-depth of 
a graph is the least number n such that some orientation of © is a subgraph of some 
order-tree of height n. For a graph (5, it follows that 

• td(<5) < n implies twd n (<3) < n; 

• twd n (0) < k implies td(<5) < nk. 

These facts are easy to establish. We will not need them in the following. 

(b) There are some simple relations between n-depth tree-width, path-width, and tree- 
width. For every graph (3, we have 

twd((S) < twd n+1 ((S) < twd„(<3) , for every n G N , 

and twd(<3) = twd n ((5) , for all sufficiently large n G N . 

Furthermore, 

pwd(<5) < n(twd n (<5) + 1) , for every n G N . 

(Let us sketch the proof of the last inequality: let (U V ) V £T be a tree decomposition of <3 of 
height n and width twd n ((5). As the components of a path decomposition of ($ we take all 
sets of the form U vo U • • • U U v . , where vq . . .Vk is a path from the root Vq to some leaf Vk 
of T.) 

The next lemma shows that most questions regarding tree decompositions of a structure 
can be reduced to the corresponding questions about its Gaifman graph. For many of the 
following results it is therefore sufficient to consider graphs. 

Lemma 4.4. Let 21 be a structure. A family (U v ) v ^t is a t re ^ decomposition of 21 if and 
only if it is a tree decomposition of Gf (21) . 
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Proof. (=>) is immediate. (<4=) follows from the fact that every tree decomposition of a 
clique has one component U v containing the whole clique. This implies that, for every 
clique C in Gf(2l) induced by some tuple c € Ri, there is some vertex v £ T with C C U v . 
Hence, every tuple c S Ri is contained in some component U v . □ 

In order to separate the higher classes of the hierarchy, we shall employ two deep results 
of Robertson and Seymour about excluded minors. 

Theorem 4.5 (Excluded Tree Theorem [RS83J). For each tree %, there exists a number 
k € N such that 

T £ Min((9) implies pwd(55) < k , for every graph & . 

Theorem 4.6 (Excluded Grid Theorem [RS86J). For each planar graph <£, there exists a 
number k € N such that 

<£. £ Min(©) implies twd((5) < k , for every graph . 

Corollary 4.7. (a) A class of graphs has bounded path-width if and only if it excludes some 
tree as a minor. 

(b) A class of graphs has bounded tree-width if and only if it excludes some planar graph 
as a minor. 

We also need a variant of these theorems for n-depth tree-width. The next lemma 
contains the main technical argument. 

Lemma 4.8. Suppose that & is a graph that does not contain a path of length I. Then <5 has 
a tree decomposition of height at most I and width at most I — 1. 

Proof. Let (T, ^) be a depth-first spanning order-tree of (5, i.e., a spanning tree such that, 
for every edge (u,v) of & we have u -< v or v ^ u (for details see, e.g., |Die06| where such 
spanning trees are called normal). We define a tree decomposition (U v ) v ^t °f ® by setting 

U v := {u € T | u ■< v} . 

Since T is depth-first, it follows that every edge (u,v) of & is contained in some compo- 
nent U w where w is the maximum of u and v. 

The height of the tree T can be at most I since contains no path of length I. Further- 
more, we have \U V \ = \v\ +1 < I. Hence, the width of the tree decomposition is at most 
I -I. □ 

Theorem 4.9 (Excluded Path Theorem). For each path there exist numbers n, k G N 
such that 

^ ^ Min(0) implies twd n ((5) < k , for every graph (3 . 

Proof. Suppose that *p ^ Min(<3) and let I be the length of Then the preceding lemma 
implies that twd;((5) < I. □ 
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Corollary 4.10. (a) A class of graphs has bounded n-depth tree-width, for some n, if and 
only if it excludes some path as a minor (equivalently, as a subgraph). 

(b) A class of graphs has bounded tree-depth if and only if it excludes some path as a 
minor (equivalently, as a subgraph). 

We can also compute a bound on the n-depth tree- width in terms of the (n + l)-depth 
tree- width. It will be needed in the proof of Theorem 18.21 below. 

We say that the tree (S, ^} can be embedded into a tree (T, ■<) if there exists an order- 
preserving injective mapping (S, ^) —¥ (T, i.e., if (S, regarded as relational structure, 
is isomorphic to an induced substructure of (T, ^). For instance, we have an embedding 



o o 




4 5 1 2 4 5 

as indicated by the labels. If S can be embedded in T then S is isomorphic to a minor of T, 
when we consider S and T as graphs. 

Definition 4.11. Let D = (U v ) ve x De a tree decomposition and let F be a set of edges of 
(the successor-tree corresponding to) T. The tree decomposition D/F obtained by contract- 
ing the edges in F is 

D/F := (U^)[ V ] €T / F , 

where U[ v] := \J ue[v] U u . 

Lemma 4.12. Let <5 be a graph and let D := (U v ) v ^t be a tree decomposition of (5 of 
width k and height at most n+l. IfmENis some number such that the tree m <n+1 cannot 
be embedded into T, then twd n ((S) < m(k + 1). 

Proof. We construct a tree decomposition D' of height at most n and width at most 
m{k + 1) — 1 as follows. Let P C T be the minimal (w.r.t. C) set of vertices that con- 
tains 

• every leaf of T at level n and 

• every vertex that has at least m successors in P. 

Since m <n+1 cannot be embedded into T it follows that P does not contain the root of T. 
Let F be the set of all edges of T linking a vertex in T \ P to a vertex in P. By definition 
of P it follows that (i) every vertex of T has less than m F-successors ; (ii) every path of T 
from the root to some leaf on level n contains at least one edge from F ; and (iii) no such 
path contains two consecutive edges from F . 

The decomposition D' := D/F obtained by contracting all edges in F has width at 
most 



k + 1 + (m - l)(/c + 1) - 1 < m(k + 1) 
Furthermore, the height of the underlying tree is at most n. 



□ 
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Corollary 4.13. Let C be a class such that k := twd n +i(C) < oo and let m G N. // every 
structure 21 G C has a tree decomposition {U v ) ve T of width k and height at most n + 1 such 
that the tree m <n+l cannot be embedded into T, then twd n (C) < m(k + 1) < oo. 

We conclude this section with a lemma that will be useful when constructing trans- 
ductions T n that transform a structure into their tree decompositions of height n. Our 
construction works for all tree decompositions that are strict in the following sense. 

Definition 4.14. Let (U v ) v ^t be a tree decomposition of a structure 21. 

(a) We define a function fi : A — >■ T by 

fi{a) := min {v G T \ a G U v } . 

Note that fi(a) is well-defined since, by the definition of a tree decomposition, there is 
at least one v G T such that a £ U v . 

(b) For v G T, we set 

U tu := (J \ |J U u . 

u>v u~<v 

(c) The tree decomposition (U v ) v is strict if, for every v G T, 

• f7„ fl / (equivalently, U V \U U ^ 0, where u is the predecessor of v) and 

• if v is not the root of T, then the subgraph of Gf(2l) induced by the set U^ v is 
connected. 

We conclude this section by a result implying that, for our purposes, it will be sufficient 
to consider only strict tree decompositions. 

Lemma 4.15. Let & be a graph. For every tree decomposition (U v ) v& t of&, there exists a 
strict tree decomposition (U^) v< =t' of& whose width and height are at most those of (U v ) v< =t- 

Proof. By induction on n € N, we will construct a sequence (U™) ve T n of tree decompositions 
such that U^ v is connected, for every v G T n with level < \v\ < n. (Recall that \v \ denotes 
the level of v, and the root is the only vertex of level 0.) Furthermore, the restriction of T n to 
the set of vertices of level at most n will coincide with the corresponding restriction of T n+ i, 
and we have = [/"", for all v G T n+ i with level \v\ < n. It will follow that the sequence 

has a limit {U£) ve T w where 

T u := (j {veT n | H <n} and U% := . 

nGN 

We start the construction with To := T and U® := U v . Suppose that we have already 
defined (J7")„ € t„- For every vertex v G T n of level \v\ = n + 1 we modify the tree de- 
composition as follows. Let Co, ... , C m _i be an enumeration of the connected components 
of U"!k v . We replace in T n the subtree rooted at v by m copies So, ... , S^n-i of the subtree, 
all attached to the predecessor of v. For u G Si we define U™ +1 := U™ fl Cj. We can do 
these modifications for all vertices of level n + 1 simultaneously. Let {U™ +1 )veT n+1 be the 
resulting tree decomposition. 

The limit (U£ ) v eT^ of this sequence satisfies the connectedness requirement of a strict 
tree decomposition. To also satisfy the other condition we proceed as follows. Let F be the 
set of all edges (n, v ) of T w such that U v fl /u(V) = 0. (Note that this implies U v C U u .) We 
construct the tree decomposition {U' v ) ve T' by contracting all edges in F. The details and 
the remaining verifications are left to the reader. □ 
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5. Tree decompositions and transductions 

In this section we relate the material presented in the preceding one to monadic second- 
order transductions. Let us start by showing that there is a transduction computing the 
minors of a graph. 

Lemma 5.1 (|Cou95j). There exists a transduction r such that r(®i n ) = Min((5), for every 
graph (5. 

Proof. A minor $j of & is obtained by deleting vertices, deleting edges, and contracting 
edges. Hence, we can encode ft by four sets: the set of vertices we delete, the set of edges 
we delete, the set of edges we contract, and a set of vertices containing one representative 
of each contracted subgraph of & (these vertices serve as vertices of the resulting graph Sj). 
With the help of these parameters we can define ft inside of 0; n by MSO-formulae. □ 

There is a close relationship between tree decompositions and transductions. 

Lemma 5.2. For every signature £ and every number k G N, there exists a transduction 
Tfc : TMEEo —7- STMi n [i7] that maps an order-tree T to the class of all incidence structures 2l; n 
such that the corresponding ^-structure 21 has a tree decomposition of width at most k with 
underlying tree T. 

Proof. Suppose that 21 is a structure which has a tree decomposition (U v ) v ^t of width k. 
We prove that 21 can be defined from a colouring of T where the number of colours depends 
only on £ and k. 

Let Co? ■ ■ ■ j ^m-l De an enumeration of all I7-structures whose domain is a subset of 
[k + 1]. For each !)£T, let il„ be the substructure of 21 induced by U v . It follows that, for 
every v G T, we can find some index X(v) such that ity = £\( v y Let ir v : ily — > be the 

corresponding isomorphism. 

Furthermore, we associate with each edge (u, v) of T the binary relation 

R(u,v) := {(7r„(a),7r v (a)) | a G U u n U v } C [k + 1] X [k + 1] . 

We can recover 21 from T with the help of the vertex colouring A and the edge colouring R. 
We form the disjoint union of all structures (£\( v ))im f° r v 6 T, and we identify two elements 
i € Cx( u ) and j S C\( v ) if (u, v) is an edge of T such that £ R(u,v). This can be 

performed by an n-copying MSO-transduction where n is the maximal size of the structures 
(£i) in , i < m. □ 

We have just seen that we can map a class of trees to a class of structures with these 
trees as tree decompositions. Conversely, if we only consider strict tree decompositions, we 
can define a transduction mapping a class of structures to the corresponding class of trees. 
Recall the function fi : A — > T from Definition 14.141 that assigns to an element a € A the 
minimal index v € T such that a G U v . 

Proposition 5.3. For each number n G N, there exists an MSO-formula ip n (x,y; Z) such 
that, for every strict tree decomposition D = (U v ) v ^t of a graph & of height at most n, there 
are sets Lq, . . . , L n -i Q V such that 

|= <p n (a,b;L) iff n(a) < (j,(b) . 
Proof. Given D we use the sets 

Li := {a G V \ \(J,(a)\ = i} 
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of all elements that first appear at level i of the tree. In particular, Lq = Uq is the root 
component of the tree decomposition. For k < n, let (5>^ be the subgraph of & induced by 
Lf. U • • • U L n —\. For a £ Li and b E Lj we define 

a <b iff i < j and a, b belong to the same connected component of 0>j , 

and a ~ b iff a ^b and b ■< a , or if a, 6 € Lo . 

Clearly, the relation < is MSO-defmable with the help of the parameters L. We claim that, 
for a, b $l Lq, we have 

a <b iff fi(a) < /i(6) . 

(<^=) Suppose that /u(a) < Then 6 € U^^ a y Furthermore, t^(a) is connected 

since D is strict. Hence, U^^ a -j is a connected component of <5>i containing both a and b. 
Since |/x(a)| < |/i(o)| it follows that a H 6. 

(=^>) Suppose that a <b. Then there exists an undirected path 7r in ©>| A1 ( a )| connecting 
a and 5. Since U^ u f) U^ v = 0, for all u ^ v such that \u\ = \v\, it follows that ir is contained 
in some U^ v such that \v\ = \fi(a)\. Since a is a vertex of n we must have v = n(a). 
Furthermore, b € £^t(a) since b is also a vertex of ir. This implies that /i(a) < □ 

Theorem 5.4. For eac/i constant n € N, i/iere exists a transduction T n mapping a graph <3 
to the class of all (underlying trees of) strict tree decompositions of & of height at most n. 

Proof. Let D = (U v ) v ^t be a strict tree decomposition of 0, and let ip n (x,y; L) be the 
formula from Proposition 15.31 with parameters Lq, . . . , L n _i C V . We can define the tree T 
underlying D as follows: 

• Its root is any element of Lq = Uq. 

• For the other vertices of T, we choose one vertex in each ~-class different from Lq. 
Note that ~ is definable with the help of ip n . 

• The ordering of T is defined by ip n . 

Hence, we obtain a transduction with parameters L that transforms a graph into a 'candidate' 
tree decomposition. Via a backwards translation we can write down a formula stating that 
the candidate given by the parameters L corresponds to an actual strict tree decomposition. 
We omit the details which are standard for this type of construction. □ 

In Lemma 15.21 we have seen how to obtain classes of bounded tree-width from classes 
of trees. Conversely, it is the case that every class obtained from a class of trees via a 
transduction has a bounded tree-width. 

Theorem 5.5. For every transduction r : TWEE m —■ STM[ n [U], there exists a number 
k € N such that, for each m- coloured tree T with image 2li n € t(T), the structure 21 has a 
tree decomposition of width at most k where the underlying tree is T. 

Remark 5.6. (a) Courcelle and Engelfriet |CE95j have shown that an incidence struc- 
ture 2t; n obtained via a transduction r from an m-coloured tree T has bounded tree-width. 
Theorem 15.51 strengthens this result by proving that, if 2ti n is the image of a tree T, then we 
can use the same tree T as the tree underlying a tree decomposition of the given width. 

(b) Lapoire has announced in |Lap98| a result somewhat related to Theorem 15.41 He 
claims that, for every k G N, there exists a transduction that transforms a given graph & of 
tree- width at most A; to a coloured tree (like in the proof of Lemma I5.2p that encodes some 
tree decomposition of of width at most k. Our result is less ambitious in the sense that 
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we only consider tree decompositions of a fixed height. This enables us to give a precise 
description of which tree decompositions (the strict ones) our transduction returns. Note 
that one can show that, for k > 2, there is no such transduction that would return all tree 
decompositions of (5 of width at most k. 

We split the proof into several lemmas. As a technical tool we introduce a second kind of 
hierarchical decompositions of structures and a corresponding notion of width. To simplify 
the definition we will only consider incidence structures. 

Definition 5.7. Let 2lj n = {A U E, P, ino, . . . } be an incidence structure. 

(a) A partition refinement of 2lj n is a family 77 = (W v , ~ v ) v <zt of pairs consisting of a 
subset W v C A U E and an equivalence relation ~.„ on W v with the following properties: 

• The index set T is a tree. 

• For the root (), we have W/\ = AU E 

• For every internal vertex (i.e., non-leaf) u € T with successors vo, . . . , v n -\, the sets 
W Vo , . . . , W Vn _ 1 form a partition of W u . 

• \W U \ = 1, for every leaf u € T. 

• x ~ v y and u ■< v implies x ~ u y. 

• If u is an internal vertex of T, v,w successors of n, not necessarily distinct, and 
x € W v , y G W w elements, then x ~ u y implies either 



Note that it follows that, for every element x £ A U E, there is some leaf u € T such that 



(b) The width of a partition refinement IJ = {W v -, Ki v)vET is the maximum number of 
equivalence classes realised in some component W v : 



The partition-width of the structure 2li n is the minimal width of a partition refinement of 2t; n . 

The notion of a partition refinement and of partition-width are adaptations of definitions 
from [Blu06, Blu03j. Up to a factor of 2, the partition- width of an incidence structure and 
its clique-width coincide. 

Example 5.8. Let 21 = (A, R) be a structure with domain A = {a, b, c, d, e} and a ternary 
relation 



or 



x, y € A and, for every e € E \ (W v U W w ) and every i 

(x, e) e m (y, e) £ m 
x,y € E and, for every a € A \ (W v U W w ) and every i 

(a, x) <E inj (a, y) G im, . 



W u = {x}. 



wd(II) := max\W v / 



V 



R = {(a, 6, c), (a, 6, d), (a, 6, e 



)}• 




Its incidence structure is 2t; n = (A U £?, Pr, ino, m i) i n 2) with 75 = {x,y,z}. We obtain a 
partition refinement 
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{a | b | c, d, e | x, y, z} 




{a | 6} {„|«i} { 2 |e} 

A A A A 

{a} {6} M {c} {j/} {d} {z} {e} 

where we have indicated the partition into «„-classes by vertical bars. This partition refine- 
ment has width 4. 

Lemma 5.9. For every partition refinement II = (W v , ~ V ) V £T °f an incidence structure 
2li n = {A U E, P, ino, . . . , in r _i), there exists a tree decomposition D = (A)^eT of 21 with 
the same underlying tree T such that 

wd(£>) < (r + 3) • wd(iT) . 

Proof. Let I : A U E — >■ Lf(T) be the function assigning to every x G A U E the unique 
leaf l(x) of T such that W^ x ) = {x}. We claim that the desired tree decomposition (A)ueT 
of 21 is given by 

U u := B u U C u U A, 
where 

B K :={tiGi|?i^ /(v) and (w, e) € in^ for some i < r and e £ E with n ^ /(e) } , 

C u := {v E A \ u ^ l(v) and (v , e) € im, for some A r and e £ E with n r< /(e) } , 

D u := { v G A | (v, e) G inj for some i < r and e£fi with /(u ) n /(e) = ii } . 

Note that the connectedness condition holds since (v , e) <G inj implies that t> belongs to 
precisely those components U u such that u lies on the path from l(v) to /(e). 

It remains to prove that \U U \ < (r + 3) • wdiT. If u = /(c), for some c £ E, then ?7 U = C u 
consists of the components of c. Hence, \U U \ = \c\ < r. Therefore, we may assume that 
u £ l[E]. Let 

[x] u := { y e W u | y « u x } 
denote the ~ u -class of x. We prove the following bounds. 

(1) \[x] u \ = 1, for all x £ B u . 

(2) \[x] u n U u \ < 2, for all x G D u . 

(3) \C u \<r-\W u /^ u \. 

Since B u , D u C W u it then follows that | U u \ = \B U U A U D u | < (r + 3) • |W U /« U |. 

(1) Let x G B u . There is some tuple e & E and some index i such that (x, e) G inj and 
u ^ 1(e). We have (y, e) G inj, for every y G W u such that y sa u x. Since x is the only such 
element it follows that [x] n = {x}. 

(2) Let x G D u . There is some tuple e G -E 1 and some i such that (x, e) G inj and 
u = /(x) l~l /(e). Let v be the successor of u such that t> r< /(e). We have (y, e) G inj, for all 
y G W M \ W„ such that y m u x. Hence, [x] u \ W v = {x}. 

Suppose that there is some element y G [x] u (~l W v (~l A- By definition of A there is 
some tuple f & E and some j such that (y, /) G inj and l(y) l~l /(/) ^ u. As above it follows 
that [x] u R W v = {y}. Consequently, we have \[x] u fl U u \ < 2. 

(3) Let x G A and consider some tuple e£fi such that (x,e) G inj and u ■< 1(e). Set 

Iu(e) '■= { z G A | (z, e) G inj for some z and u ^ Z(z) } . 
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For e, / G E n W u , it follows that 

e~«/ implies 7 M (e) = I u (f) . 

Furthermore, we obviously have |/ u (e)| < |e| < r. It follows that C u contains at most 
r • |W U /~ U | vertices. □ 

Lemma 5.10. Let r : TM£E m — > STMi n [i7] be a basic MSO -transduction such that, for 
every m-coloured order-tree T with image 2ti n € t(T), we have 

Al) E = Lf (1) and yl n E = . 

Then there exists a number n G N suc/i i/iat, for every order-tree T, we can find a partition 
refinement (W v , ~ V ) V £T of t(T) of width at most n. 

Proof. Let (x,S(x), (^Pp r (x))r, (<Pim(, x ,y))i<r) De the definition scheme of r, and let h be 
the maximal rank of these formulae. 

Given % we define the desired partition refinement 77 = (W u , ~ u )«gt by setting 

W u := {x G Lf(T) | u < x}, 

and x^ u y : iff MTh h (T„ , x) = MTh h , y) , 

where v, w are the successors of u with x G PV„ and y G . 

(If n is a leaf of T then VF n = {x} and we take the equality relation for Note that the 
index of ~„ is finite and that it only depends on h and not on the input tree T. 

It remains to show that 77 is actually a partition refinement. First, let us prove that 
x ~ v y and u <v implies x ~ u y. It is sufficient to consider the case that u is the predecessor 
of v. Then the general case follows by induction. Hence, suppose that v is a successor of u, 
that w, w' are successors of v, and that x, y are leaves with w ^ x and w' < y such that 
x m v y. Then we have 

MTh h (<Z w ,x) = MTh h (Z w ,,y), 
which, by Lemma |3.14[ implies that 

MTh h (X„,x) = MTh /l (T„, J/ ). 

Consequently, we have x ~ u y. 

We also have to show that the incidence relation is invariant under ~ u . Let v,w be 
successors of u and suppose that x, y are leaves with v < x and u> < y such that a; ~ u y. 
We distinguish two cases. 

Suppose that x, y G A and let e G 75 x (W„ U W w ) be an edge. Since 

MTh h (T v ,x) = MTh h (1 w ,y) , 

it follows that 

£ H Vinifae) iff T^(^ ini (y,e). 

Hence, (x,e) G in^ iff (y, e) G inj. 

Now, suppose that x,y £ E and let £ G A \ (H 7 ^ U W w ) be an element. Since 

MTh h (1 v ,x) = MTh h (1 w ,y), 

it follows that 

%\= (pim(z,x) iff 1 |= (^ in .(2;,y) . 
Hence, (z,x) G in, iff (z,y) G inj. □ 
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Proof of Theorem 13 (1) First, suppose that r : TMEE m STMinfr] is a basic MSO- 
transduction such that, for every m-coloured order-tree T G dom(r) with image 2lj n G t(X), 
we have 

A U E = Lf (1) and A n £? = . 

It follows by Lemma [5 , 1 1 that there is a number toeN such that, for every tree X, we can 
find a partition refinement of 2lj n G r(T) with underlying tree X whose width is at most w. 
By Lemma 15.91 it follows that 21 has a tree decomposition (U v ) v( zt with underlying tree T 
and whose width is less than k := w(r + 3). 

(2) If r is a basic MSO-transduction such that 

4 U E C Lf (T) and 4 n £ = , for all 2l in € r(T) with T G dom(r) , 

then we can argue similarly Let r' be the MSO-transduction mapping T to the structure 
obtained from r(T) by adding one isolated element for every leaf of X that does not corre- 
spond to an element of t(T). Then r' is of the form considered in (1) and we obtain a tree 
decomposition (U v ) v ^t 01 T '(^)- Deleting from every component U v all elements not in t(T) 
we obtain the desired tree decomposition of t(T). 

(3) Suppose that r is a non-copying MSO-transduction as in (2) but with p parameters. 
We can regard r as a basic MSO-transduction TM£E m+; , — > STRj n [i7]. By (2) it follows that, 
for every value of the parameters P, the structure r(T, P) has a tree decomposition of the 
required form. 

(4) Finally, consider the general case. Suppose that r is Z-copying. Given % let T + be 
the tree obtained from % by adding I new successors to every vertex of T. Formally, suppose 
that T C D <UJ , for some finite set D. W.l.o.g. we may assume that D n [/] = 0. We define 
the domain T+ C(flU [1]) <U} of 1+ by 

T+ := TU (T x [/]) . 

Furthermore, we add new colour predicates 

Si := T x {i} , for i G [I] . 

Note that every element of t(T) is of the form (v, i) where i G [I] and v G T. Hence, each 
such element corresponds to a leaf m G T X [I] Q T + . Using the parameters S we can 
construct a basic MSO-transduction r + : TMEE m+ ; — > STMi n [U] satisfying the conditions 
of (3) such that r + (T + ) = r(T). By (3), we obtain a tree decomposition H + = (U^~) v& x+ 
of r + (T + ) = t(T). Let H = {U v ) v< zt be the tree decomposition obtained from H + by 
contracting every edge leading to a leaf in T + \ T. Then we have 

wd(F) + 1 < (I + l)(wd(#+) + 1) 

and wd(H + ) < w , for some ioGN independent of T. □ 

6. The transduction hierarchy 

The focus of our investigation lies on the following preorder on classes of structures 
which compares their 'encoding powers' with respect to MSO-transductions. Our main 
result is a complete description of the hierarchy induced by this preorder. It will be given 
in Theorem 16.41 

Definition 6.1. Let C,1C C STIR. We define the following relations. 
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(a) C C /C if there exists a transduction r such that C C t(/C). 

(b) C C /C if C C /C and £ £ C. 

(c) C = /C if C C /C and /C C C. 

(d) C <d /C if C C /C and there is no class £> with C C P (Z /C. 

(e) C C in /C if C in C/C in . 

(f) The relations C m , = m , and <i n are defined analogously to C, =, < by replacing C 
everywhere by C m . 

The transduction hierarchy is the hierarchy of classes C C STIR induced by the relation rZj n . 

As the class of transductions is closed under composition, it follows that the relation C; n 
is a preorder, i.e., it is reflexive and transitive. 

Lemma 6.2. C m is a preorder on ^(STR). 

Definition 6.3. We consider the following subclasses of STR[{edg}]. (All trees below are 
considered to be successor-trees.) 

(a) 7~ n '■= { m <n | m € N } is the set of all complete m-ary trees of height n. 

(b) 7bin is the class of all binary trees. 

(c) %j is the class of all trees. 

(d) V is the class of all paths. 

(e) Q is the class of all rectangular grids. 

The following description of the transduction hierarchy is the main result of the present 
paper. 

Theorem 6.4. We have the following hierarchy: 

<in % <in 71 <in • • • <in %, <in ' ' • Ci n V <i n % =in 7bin <in Q 

For every signature £, every class C C STMfZ 1 ] is = m - equivalent to some class in this 
hierarchy. 

Remark 6.5. There is a lot of flexibility in the choice of representatives for the various 
levels. For instance, we could replace T n by the class of all trees of height at most n, T u by 
Tbin or { 2 <n | n € N }, and Q by the class of square grids. 

It is straightforward to show that the above classes form an increasing chain. The hard 
part is to prove that the chain is strictly increasing and that there are no further classes. 

Lemma 6.6. We have 

Ein To !=in 7l Ein ' ' ' Ein T Ein ' ' ' Ein *P Ein T C=in Q ■ 

Proof. In the example before Lemma 13.71 we have constructed transductions r n such that 
T n T n (V). Hence, T n Em T. The remaining assertions follow from the observation that, 
by Lemma 15.11 C Q Min(/C) implies C Ci n fC. □ 

Let us collect some easy properties of the hierarchy. Our first result states that Q is a 
representative of the top level of the transduction hierarchy. 

Lemma 6.7. STIRfZ 1 ] C in Q 

Proof. Recall that the m x n grid is the undirected graph & = {V, edg) with vertices 
V = [m] x [n] and edge relation 

edg = {((i,k),(j,l)) | \i-j\ + \k-l\ = 1}. 
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Before encoding arbitrary structures in such grids we describe a transduction mapping (8 to 
its directed variant (V,Eq,Ei) where 

E := { ({i, k), (i + 1, k)) \ % < m - 1, k <n} , 

and E\ := { ((i, k), (i, k + 1)) | i < m, k < n — 1 } . 

This can be done with the help of the parameters Pq, P\, P2, Qo, Qi, Q2 Q V where 

Pm '■= { (*j k) \ i = m (mod 3) } , 

and Q m := {(i,k) \ k = m (mod 3) } . 

Then 

Eq = { (it, v) G edg I u G Pi and v G Pj for some i = j — 1 (mod 3) } , 

and Ei = { (u, v) € edg | u G Qi and t> G Qj for some i = j — 1 (mod 3) } . 

It is easy to write down a formula checking that the parameters P m and Q m are correctly 
chosen (see, e.g., |Cou97j). 

To show that STM[i7] Cj n Q, suppose that 21 G STIR [17] is a structure with 2li n = 
(A U -E 1 , (Pr)r, ino, . . . , in r _i). Fix enumerations ao, • • • , a m -i of ^4 and eo, • • • , e n _i of 
By the above remarks, it is sufficient to encode 2lj n in the directed mx n grid. Consider the 
following subsets of [m] x [n] : 

A' := [m] x {0} , := { (0, k) \ e k G P R } , 

^' := {0} x [n] , I/ := { (*, fe) | (a,, e fc ) G in/ } . 

Then 2t; n can be recovered from & by an MSO-transduction using these sets as parameters. □ 

Lemma 6.8. % =m % m - 

Proof. For one direction, note that 7bin Q %j implies 7bi n Qm 7u- Conversely, each finite 
tree can be obtained as minor of a binary tree. Therefore, we have T w C Min(Tbin) Ein 7bi n . 

□ 

Lemma 6.9. We have C =i n 71 if and only ifC is finite and contains at least one nonempty 
structure. 

As indicated in the example before Lemma [3.71 there exists a transduction mapping an 
incidence structure 2li n to the incidence structure Gf(2l)j n of the Gaifman graph of 21. 

Lemma 6.10. For every class C of structures, we have Gf(C) C; n C . 

The next result is just a restatement of Lemma 15.11 in our current terminology. 

Lemma 6.11. For every class C of graphs we have Min(C) = m C . 

7. Strictness of the hierarchy 

In this section we prove that the hierarchy is strict. Using the results of Section [5] we 
can characterise each level of the hierarchy in terms of tree-width and its variants. 

Theorem 7.1. Let C C STM[Z]. 

(a) C Em V iff pwd(C) < 00. 

(b) C Em % iff twd(C) < 00. 

(c) C Ei n T n iff twd n (C) < 00. 

Proof. In each case (■<=) follows from Lemma [5. 21 and (=>■) follows from Theorem 15.51 □ 
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Corollary 7.2. Let C be a class of U- structures. 

(a) pwd(C) = oo implies Ti, C in C. 

(b) twd(C) = oo implies Q C in C. 

Proof, (a) Suppose that pwd(C) = oo. Then Theorem 14.51 implies that T w C Min(Gf(C)). 
Hence, the claim follows from Lemmas 16.101 and 16.111 

(b) Suppose that twd(C) = oo. Then Theorem 14.61 implies that Q C Min(Gf(C)). As 
in (a), the claim follows from Lemmas 16.101 and 16.111 □ 

Corollary 7.3. Let C C STR[27|. 

(a) C V implies %j Q m C. 

(b) C gi n % implies Q = m C. 

In particular, it follows that the upper part of the hierarchy is strict: 
Corollary 7.4. V <- m < in G 

Proof. Both assertions follow from Theorem 17.11 and Corollary 17.31 

For the first one, note that we have V C in T w since V C Min(7L)- Conversely, pwd(7L) = 
oo implies, by Theorem 17.11 (a). that %j gin T- Hence, V Ci n 7L- Finally, if C C; n T w then 
C Ci n V, by Corollary 17.31 (a). Consequently, we have V <\- m 7L- 

Similarly, the fact that T u Ci n Q follows from Lemma 16.71 Since twd(^) = oo, Theo- 
rem ET](b) implies that %j C; n Q- Finally, we obtain %j <l; n Q by Corollary 17.31 (b). d 

Let us turn to the lower part. We start with two technical lemmas. 

Definition 7.5. Let T = (T, <) be an order-tree. Vertices vo, . . . , v m -\ of T are horizontally 
related via a vertex w if all v j are at the same level of the tree and Vi n Vk = w, for all 
< i < k < m. 

Lemma 7.6. Let X be a coloured order-tree of height n, and suppose that r is a parameterless 
k-copying MSO -transduction of rank r such that r(T) is a successor-tree of height at most 
n + 1. Consider vertices vq, . . . ,f m -i of % that are horizontally related via w and fix some 
number I < k. Let Xi be the successor of w with Xi -< V{. If, for all i,j < m, we have 

MTh r+2 n+i(T a;i ,Wi) = MTh r+ 2„+i(T Xj ,t;j) , 

then at least m — 1 elements of the set {{vq,1), . . . , (v m ~i,l)} (these are elements of the 
domain o/r(T)) are horizontally related in t(T). 

Proof. Let (p ss /(x, y) be the formula defining the successor relation in t(T) between vertices 
of the form (x, s) and (y, s'). By assumption the rank of ip ss /(x, y) is at most r. 

First, note that a vertex {v,l) is on level h in t(T) if and only if there are indices 
So, ••• , Sh-i < k such that 

where the formula 

i<h-l 

A \J (p sso (z,y ) 

s<k 
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expresses that there exists a path of the form (yo, so), ■ ■ ■ , {Vh—X-, s h-i), (v, I) from the root 
(yo> s o) °f T (^) to the vertex (v, I). By assumption on Vi and Lemma 13,141 we have 

+2n+l(% Vi) = MTh r . + 2n+l('^, Vj) , for all i,j . 

Since the rank of ip S o—sh-i 1S h + r + 1 < r + 2n + 1 it follows that 

Jh^o.^-iW iff £h^*o...«h-i( u i)- 

Hence, all vertices (vo, /},..., (i> m --i, I) are on the same level h in t(T). We prove by induction 
on h that 

(*) MTh r+n+h+1 (1 Xi ,Vi) = MTh r+n+h+1 (1 Xj ,Vj) 

implies that all but at most one of (vo, /},..., (v m -x, I) are horizontally related. 
Let (ui,Si) be the predecessor of (v^l) in r(T), that is, 

We distinguish two cases. 

First suppose that uq F\ vq ■< w in %. By (*) and Lemma [3.141 we have 

MTrv+n+ZH-iC^o^o) = MTh r+n+/l+1 (T,u ,^) , 

for all i such that uo n ^ ^ w. Note that there can be at most one index i that does not 
satisfy this condition since, if we had uq n Vi >z w and uq n Vj y w, for i ^ j, then we would 
have Vi HVj y w and vq, . . . , i> m -i would not be horizontally related via w. It follows that 

T \= (p so l(uo,Vo) implies T |= (p so l(uo,Vi) , for all i as above. 

Hence, (uq,So) is the common predecessor of all the (vi,l), except for possibly one of them. 
(For an index i with no l~1 v% y w our composition argument does not work since in that case 
(*) does not imply that the theories of (%, Uq,Vo) and (%, UQ,Vi) coincide.) 
It remains to consider the case that w -< uq n vq. Setting 

rj Ui := /\MTh r+n+h - 1+ x('X x .,Ui) 

we have 

% Xo \= 3z[\z\ = \u \ Aj)„ (z) A<p so i(z,v )) . 

Since the rank of this formula isr + n + h + 1 it follows that 

% Xi |= 3z[\z\ = \u \ A rj Uo (z) A <p so i(z,Vi)] , for alH < m. 

Consequently, we have \u{\ = \uq\, for all i, and uq, . . . ,u m —x are horizontally related via w. 
Since the vertices {uq, Sq), . . . , (u m _i, So) are on level h— 1 in r(T), we can apply the induction 
hypothesis and it follows that all but at most one of then are horizontally related via some 
vertex to'. Therefore, the same holds for their successors {vq, /},..., (v m -x, I)- O 
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Definition 7.7. We denote by B(n,k,c) the number of functions m <n — > &([c}) with 
m < k. Intuitively, each such function corresponds to a vertex-colouring of the tree m <n 
with c colours. 

Lemma 7.8. For n > 1 and k > 2, we have 

2 cfc " _1 < B{n,k,c) < k2 2ck "~\ 

Proof. For m > 2, we have 

. , m n_1 — 1 



< m n - 1 + V m* = m"" 1 + — < 2m r > 

. ^-^ m — 1 



i<n— 1 

,n-l 



Since | [m] <n | = X^i<n m * = mn + Yl%<n-l m% ^ follows that 

m 11 - 1 < \ [m] <n \ < 2m! 1 - 1 . 



Therefore, we can bound 



B(n,k,c) =2 cn +J2 2 



c\[ra}< 

m=2 



from above by 



and from below by 



B(n, k,c) <2 cn +Y^ 2 c2m " 1 < k2 2ch " ' 

m=2 



-> n ~ 1 - ocfc" -1 



□ 



B{n,k,c) >2 cn +Y, 2 cm " 1 > 2 

m=2 

Theorem 7.9. 7^ C; n %+i 

Proof. For a contradiction, suppose that there exists a transduction r such that (7^+i)i n C 
r((7n)in)- Let T^ 3 ^ be the class of all order-trees corresponding to successor-trees in T n , and 
let 7^°\ := exp 1 (7^+i) be the class of all coloured successor-trees with one colour whose 
underlying tree is in T n +i- Since the successor-trees in l~ n are 1-sparse we can construct an 
MSO-transduction <To such that (7^)i n ^ ^o(%i)- Since 7^ rd = 7^ we can combine r and o"o 
to a transduction a such that 7^°i C o~(7£ ). By Lemma 17. 6| it follows that there is 
some constant c? such that every tree X G cr ('7^ )rd ) with out-degree at most k is of the form 
cr(X') where X' E 7^ rd has out-degree at most dk. (The out-degree of an order-tree is the 
out-degree of the corresponding successor-tree.) Suppose that a uses c parameters. There 
are 

B{n,dk,c) < dk2 2 < dk ^~ 1 

colourings of trees in 7^ rd with out-degree at most dk. On the other hand, there are 

B(n + l,k,l) > 2 fc ™ 

trees in 7S\ with out-degree at most k. For large k it follows that 

B(n, dk, c) < dk2 2cdn ~ lkn ~ 1 < 2 kn = B(n + 1, k, 1) . 

Consequently, there is some tree in 7^?\ that is not the image of a tree in 7^ rd . A contra- 
diction. □ 
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8. Completeness of the hierarchy 

We have shown that the hierarchy presented in Theorem 16.41 is strict. To conclude the 
proof of the theorem it therefore remains to show that there are no additional classes. We 
have already seen in Corollary 17.41 that %j and Q are the only classes above V . Next we 
shall prove that T n < Tn+l- 

Lemma 8.1. Let C be a class of structures. If for every number m 6 N, there exists a struc- 
ture 21 € C such that we can embed m <n+1 into every tree underlying a tree decomposition 
(U v ) v< =t o/2l of width k, then T n +i Ein C. 

Proof. By Lemma I4.15[ it follows that, for every m 6 N, there is a structure in C with a 
strict tree decomposition of width at most k and with an underlying tree T into which we 
can embed the tree m <n+1 . According to Proposition 15.31 there is a transduction mapping C 
to the class of trees underlying these strict tree decompositions. Hence, there exists a class 
Ein C °f successor-trees containing, for every m 6 N, some tree into which we can embed 
m <n+1 . Hence, T n +i C Min(/C) Ein K Ein C. □ 

Theorem 8.2. Let C be a class of structures. Then T n +i 2in C implies twd n (C) < oo. 

Proof. Suppose that 7^+i [2m C. Then V ^ m C since 7~ n +i Ein V- By Lemmas l6.1Ul and l6.11[ 
this implies that V (£. Min(Gf(C)). Therefore, we can find a path that is not in Min(Gf(C)). 
By Theorem 14.91 it follows that there are numbers k, I G N such that twdj(C) < k. 

By induction on I, we prove that twdj(C) < oo implies twd„,(C) < oo. For I < n, there 
is nothing to do. For I > n, we have 7~n+i Ein 7j, which implies that 71 l^m ^- Consequently, 
it follows by Lemma 18.11 and Corollary 14.131 that twd/_i(C) < oo. By induction hypothesis, 
the result follows. □ 

By Lemma [5.21 we obtain the following results. 

Corollary 8.3. IfT n +\ Em C then C Ein T n . 

Corollary 8.4. T n <in 

To conclude the proof of Theorem 16.41 it remains to show that there are no classes 
between the lower part of the hierarchy and its upper part. 

Lemma 8.5. Let C be a class of structures. If T n Ein C, for all n £N, then V C in C. 

Proof. We show the contraposition. Suppose that V ^in C We have to show that there is 
some n such that T n Ein C- As in the proof of Theorem 18.21 it follows that there are numbers 
k, I € N such that twdj(C) < k. Hence, we can use Lemma 15.21 to obtain a transduction r 
witnessing that C Ein Ti- By Theorem 17.91 we have 7I+i [2m Ti- It follows that 71+ 1 [2in C, 
as desired. □ 

Corollary 8.6. If C Ci n V then there is some n G N such that C Em Tn- 

Proof. By Lemma 18.51 there is some n € N such that 7^+i 2in C. Hence, Corollary 18.31 
implies that C Ein T n - D 

Together, Corollaries EU KM and EM (and the fact that every class C satisfies [Hj n 
C Qi n Q) show that every class of ^-structures is =i n -ec|uivalent to some of the classes in 
Theorem 16.41 This completes the proof of this theorem. 
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9. Prospects and conclusion 

Above we have obtained a complete description of the transduction hierarchy for classes 
of finite incidence structures. The most surprising result is that the hierarchy is linear. At 
this point there are at least three natural directions in which to proceed. 

(i) We can study the hierarchy for classes of structures, instead of classes of incidence 
structures. 

(ii) We can consider the hierarchy for classes of infinite structures. 

(iii) We can replace MSO by a different logic. 

An answer to (ii) seems within reach, at least if we restrict our attention to countable 
structures. Although the resulting hierarchy is no longer linear we can adapt most of our 
techniques to this setting. (For an example of nonlinearity, note that the class of all countable 
trees and the class of all finite grids are incomparable.) 

Concerning question (iii), let us remark that all results above go through if we use 
CMSO instead of MSO. We only need the right definition of rank for GMSO. In the proof of 
Theorem 17.91 we needed the fact that there are only finitely many theories of bounded rank. 
We can ensure this for CMSO by defining the rank as the least number n such that 

• the nesting depth of quantifiers is at most n and 

• in every cardinality predicate \X\ = k (mod m) we have m < n. 

One can check that, with this definition of rank, the proof of Theorem 13 . 1 3 1 also goes through 
for CMSO. 

For logics much weaker than MSO, on the other hand, it seems unrealistic to hope for 
a complete description of the corresponding transduction hierarchy. For instance, a related 
hierarchy for first-order logic was investigated by Mycielski, Pudlak, and Stern in [MPS90J. 
The results they obtain indicate that the structure of the resulting hierarchy is very compli- 
cated. 

Finally, let us address question (i). When using transductions between structures instead 
of their incidence structures, we can transfer some of the above results to the correspond- 
ing hierarchy But we presently have no complete description since we miss some of the 
corresponding excluded minor results. 

Lemma 9.1. Let C,S C STIR and suppose that S is k-sparse. 

(a) C Q{ n S implies C C S. 

(b) S QC implies S Qi n C. 

Proof. There is a transduction g such that C = g(Ci n ). Since S is fe-sparse we can also find 
a transduction a such that <Si n = cr(S). Consequently, 

Cm Q T(5i n ) implies C C (g o r o a)(S) , 

and S C t(C) implies Si n C (a o r o g)(Ci a ) . □ 

Theorem 9.2. We have the following hierarchy: 

$ n To n Ti n ■ ■ ■ n T n - ■ ■ n T n % n G = §TR[£] 

Proof. Note that all classes in Theorem 19.21 are 2-sparse. For 2-sparse classes C and /C, 
Lemma [9 . 1 1 implies that 

CC/C iff C C in K . 
Consequently, the result follows from Theorem 16.41 □ 
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Open Problem 9.3. Is there any class C C §TIR[Z'] which is not =-equivalent to some 
class in the above hierarchy? 

Remark 9.4. If we only consider classes of graphs and if we use CMSO-transductions instead 
of MSO-transductions, then the following result can be used as replacement of Theorem 14.61 : 

Theorem 9.5 ( |CO07| ). Let C be a class of graphs with unbounded clique-width. There 
exists a CMSO -transduction r with Q C t(C). 

This eliminates some possibilities for intermediate classes of graphs in the hierarchy of The- 
orem 19.21 but to complete the picture we still need analogues of Proposition 15.31 and of 
Theorems 14.51 and 14.91 Furthermore, the techniques of [CO07J are specific to graphs (or, 
more generally, to relational structures where all relations are binary). Even with the re- 
sults of | CO07| one cannot exclude the existence of a class C of arbitrary relational structures 
strictly between T w and Q in the CMSO-transduction hierarchy. 

Let us make a final comment about relational structures. An incidence structure 2l; n 
can be seen as a bipartite labelled directed graph (see the remark after Definition I2.ip . 
Furthermore, it is 1-sparse. Hence, our results use tools from graph theory, in particular 
those of |RS83, RS86, [CO07| . However, there is currently no encoding of relational structures 
as labelled graphs that could help to solve question (i) above. 
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